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ABSTRACT 

Using ^300,000 photometrically classified quasars, by far the largest quasar sample ever used for such 
analyses, we study the redshift and luminosity evolution of quasar clustering on scales of ~50 h^ 1 kpc 
to ^20 hr 1 Mpc from redshifts of z ^0.75 to z ^2.28. We parameterize our clustering amplitudes 
using realistic dark matter models, and find that a ACDM power spectrum provides a superb fit to our 
data with a redshift-averaged quasar bias of b^ 1A0 = 2.41 ± 0.08 (P <x 2 = 0.847) for cr 8 = 0.9. This 

represents a better fit than the best-fit power-law model (w = 0.0493 ± O.OO640-°- 928±0 055 ; P <x 2 = 
0.482). We find 6q increases with redshift. This evolution is significant at > 99.6% using our data set 
alone, increasing to > 99.9999% if stellar contamination is not explicitly parameterized. We measure 
the quasar classification efficiency across our full sample as a — 95.6±f;|%, a star -quasar separation 
comparable with the star-galaxy separation in many photometric studies of galaxy clustering. We 
derive the mean mass of the dark matter halos hosting quasars as Momh = 5.2 ± 0.6 x 10 12 h -1 M0. 
At z ~ 1.9 we find a 1.5a deviation from luminosity-independent quasar clustering; this suggests that 
increasing our sample size by a factor of ~1.8 could begin to constrain any luminosity dependence 
in quasar bias at z ~ 2. Our results agree with recent studies of quasar environments at z < 0.4, 
which detected little luminosity dependence to quasar clustering on proper scales > 50 h~ x kpc. At 
z < 1.6, our analysis suggests that 6q is constant with luminosity to within A6q ~ 0.6, and that, for 
g < 21, angular quasar autocorrelation measurements are unlikely to have sufficient statistical power 
at z < 1.6 to detect any luminosity dependence in quasars' clustering. 

Subject headings: cosmology: observations 
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1. INTRODUCTION 

As the form of the nonbaryonic, cold dark mat- 
ter that underpins mass in the cosmo s becomes 
incre asingly accurately described (e.g., I Cole et all 
2005), understanding the baryonic processes that 
fuel quasars and trigger galaxy formation becomes 
an increasingly realistic endeavor. It is now es- 
tablished that most, if not all, local galaxies har- 
bor a supermassive black ho le (|Kormendv fc Richstonel 
119951 : iRichstone et al.1 119981 ) and that the mass of 
these black holes correlate with several proper- 
ties of their host galaxy's bulge (iMagorrian et al.1 
1998t iFerrarese fc Merrittl [20001 iGebhardt et alj 1200"' 



Graham et alJl2002t iTremaine et alJl2002t lWvithdl200 
implying a causal link between black holes and sta r for- 
mation in galaxy spheroids (e.g.. lSilk fc Reed [T998l Ob- 
servational evidence is accumulating to suggest that such 
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a cau sal link remains at higher redshift (IShields et al.l 
[2003h . 

It has long been suspected that accretion of baryons 
onto supermassive black holes is responsible for the 
powerful U V-excess (U VX) emission seen in quasars 
(see, e.g., iReesI 119841 for a review), so the role of 
supermassive black holes in galaxy formation suggests 
an interplay between nascent galaxies and quasar 
activity. A symbiotic view of galaxy formation has 
emerged, in which galaxy mergers drive the formation 
of quasars, and superma ssive black holes, whi c h in turn 
seed n ew galaxies (e.g.. iHeckman et ail 119861: ICarlbergl 
1990t iBarnes fc Hernquist! Il992t lLacev fc Cold 119931 : 



Pi Matteo. Springel fc Hernauisffl2005[) . Given that only 
merging systems of a certain minimum mass can trigger 
a UVX quasar phase visible aga inst background sta r 
formation in the host galaxy (e.g.. iHopkins et aT]|2006h . 
this picture is consistent with emerging evidence that 
at z < 2.5 quasar bias evolves with redshift but that 
quasars inhabit dark matter halos of similar average mass 
at every redshift dPorciani. Magliocchetti fc Norberd 
120041 : ICroom et all 120051 : iMvers et alJ l2006f >. However, 
the simplicity of this picture belies a rich complexity in 
the important physical processes that entwine quasar 
galaxy and star formation (see Hop kins et al.l 120061 
for a review). Some of the many theoretical insights 
into this complexity have included the importance of 
galaxy mergers (e.g.. pToomre fc Toomrd Il972t Iw hitc 



19791: Ncgropo ntefc Whitd 119821 : IBarnes fc Hernquist 
19921 : iFranchesini et all 1 1999T > . cooling flows (e.g 
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ICiotti fc Ostrikerl 119971 . 12001( 1. he ating throug h var- 
ious feedback mechan is ms (e.g.. ISilk fc Reesl 119981 : 
iWvithe fc Loebl 120021 iSpringel fc Hernauistl l200l 
and/or the eventual cutoff of accretion onto a central 
black hole by gas ejection (e.g., ISilk fc Ree s 1998; Fabianl 
ll999HSazanov et al.ll2005T ). 

Clearly, quasar evolution is an important tracer of 
galaxy formation; however, the large number of compo- 
nents that help regulate models of quasar activity beg 
new constraints. Measurements of quasar clustering am- 
plitudes, which directly correlate with the average mass 
of the halos that harbor quasars, have provided useful 
broad constraints on gravitationally driven aspects of 
quasar evolution. However, the relevance of gas physics 
to quasar evolution means that measurements of the lu- 
minosity function of quasars (see Richards ct al. 2006 for 
a review) have also proved key in constraining baryonic 
elements of qu asar evolution, e.g., quasar lifetimes via the 
"duty c ycle" (IHaiman fc Hui 2001 IMartini fc Weinberg] 
1200 il l. iHopkins et alj (|2005bD have suggested that the 
peak luminosity distribution of quasars is fundamental 
to characterizing the quasar population; this suggests 
that important observational constraints will emerge by 
considering baryons and gravity in tandem, by measur- 
ing the luminosity evolution of quas ar clustering (e.g., 
IValageas etalll200lHLidz et al.ll2006t l. 

Since the first significant detections of quasar 
clustering, the t wo-po i nt correlation function (e.g., 
iTotsuii fc Kiharal 119691 : iPeeblesI 11980( 1 has frequently 
been used to measure the amplitude of quasar clustering, 
and accuracy has improved in step with sample sizes. Re- 
cent detections, in the wake of large spectroscopic quasar 
surveys, have led to some confidence about th e evolution 
of quasar clustering (e.g.. lCroom et al1l2005l : henceforth 
Cro05), constraining a lack of evolution in the dark mat- 
ter halos that host quasars to ^50%. However, the de- 
pendence of quasar cluste ring on luminosi ty appears to 
be quite weak (e.g., Cro05: lLidz et al.ll2006f) . and probing 
variations in quasar clustering as a function of luminos- 
ity or other physical properties, as well as more tightly 
constraining evolution in quasar clustering, is limited by 
quasar sample sizes. 

Large samples of photometrically selected objects have 
long been used to p robe angular galaxy clustering 
(jGroth fc P eebles 1973), leading to impor tant cosmolog- 
ical co nstraints such as early detections (Maddox et al. 
Il990al ) of deviations from Standard Cold Dark Matter 
(CDM) models or ea rly detections of Dark Energy (e.g., 
IScranton et al. 1 120031 ). Such angular analyses were com- 
plimentary to analyses using spectroscopic data because 
of the larger numbers of galaxies that could be photo- 
metrically selected. Similar angular clustering analyses 
of complimentarily large samples of photometrically se- 
lected quasars were impossible, however; star-galaxy sep- 
aration became efficient in galaxy surveys with the ad- 
vent of automatic plate measurements (e.g., 90 -95% effi- 
cient in the APM survey: iMaddox et aDl990H l but star- 
quasar separation lagged behi nd (e.g. ,^60% effic ient in 
the 2dF QSO Redshift Survey: ICroom et al.ll200l hence- 
forth 2QZ). With the recent adven t of sophisticated pho- 
tomctric classification of quasars ((Richards et alJ 12004( 1 
star-quasar sep aration at many redshifts is now highl y 
efficient (~95%: lRichards et al.ll200llMvers et al.ll2006D . 
meaning that large quasar samples can be used to mea- 



sure angul ar quasar clusterin g as a function of physical 
properties (jMvers et al.ll2006( l. and to use angular quasar 
clustering to prob e Dark Matter (IScranton et alj I2005T ) 
and Dark Energy (IGiannantonio et al.ll2006f T 

In IMvers et all ((20061 : henceforth Mve06). we pre- 
sented a first, proof-of-concept, analysis of the cluster- 
ing of ^80,000 photometrically classified quasars. In 
this series of papers, we extend this work, improving 
our modeling techniques and presenting measurements 
of the two-point correlation function of ^300,000 photo- 
metrically classified quasars drawn from the fourth data 
rele ase (DR4) of the Sloan Digital Sky Survey (SPSS 
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e.g..|Stoughton et all 120021 lAbazaiian et al.ll2003L 12004 
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2005). Our goal in this paper is to study the dependence 
of quasar clustering on redshift and luminosity, focusing 
on li near and quasi-lin ear scales. In a companion pa- 
per (jMvers et al.l l2007t henceforth Paper2), we analyze 
quasar clustering on smaller scales. 

Extensive details of our techniques, modeling, and sys- 
tematics are presented in Appendixes A and B, allow- 
ing our main analysis to be presented in Section [31 af- 
ter detailing our data sample in Section [2] Our main 
results are ordered in a concluding section (Section [5]). 
Unless otherwise specified, we assume a ACDM cosmol- 
ogy with (O m , Qa s cts, T,h = ffo/lOOkm s" 1 Mpc" 1 ^ 
(0.3, 0.7, 0.9, 0.21, 0.7), where T is the shape of the mat- 
ter power spectrum (r = Q m h for baryon-free CDM). 
We correct all m agnitudes for Galactic extinct i on us ing 
the dust maps of lSchlegel. Finkbeiner fc Davis! (j!998l ). 

2. THE DR4 KDE SAMPLE 

The quasar sample that we analyze is constructed 
us ing the Kernel Densit y Estimation (KDE) technique 
of Richar ds et al.l (|2004f ). which draw s on many uniqu e 
technical aspects of the SDSS (e.g., lYork et al.l l2000h 
inclu d ing superior pho t ometry (e.g., iFukugita et al.l 



19961: iGunn et alJ Il998t iLupton. Gunn fc Szalavl 119991 : 
Hogg et all 12001b I Smith et al 1 120021 : llvezic et alj \2004) 



astrometr y (e.g., iPier et alj 120031), and data acquisi- 
tion (e.g.. IGunn et al.| [200l iTucker et al.ll2006l). As in 
Rich ards et all (|2004) , the sample is restricted to SDSS 
point sources with u — g < 1, (observed) g > 14.5 and 
(dereddened) g < 21. Separations, in 4-dimensional 
color-space, from a sample o f ~10% of point sourc es in 
SDSS Data Release 1 (D R1: lAbazaiian et al.ll2003l) and 
from the quasar sample of ISchneider et al.l ((20031 : hence- 
forth DR1QSO) are determined for each object to be 
classified. A Bayesian classifier then assigns each object 
a probability of being a "quasar" or "star" . Taken in 
logarithmic ratio, the distribution of these probabilities 
is sufficiently bimodal to s eparate z < 2.5 quas ars from 
stars with ~95% efficiency ((Richards et al.ll200~4l see also 
Mye06). Applying the KDE technique to SDSS DR4, 
results in our DR4 KDE sample of 344,431 objects 8 , a 
sample 3.5 x larger than the DR1 sample used in Mye06. 
Each KDE object is a ssigned a photometric redshift es- 
timate as described in IWeinstein et aT] ((2004( 1 . 

To meaningfully model quasar clustering we must 
know the normalized redshift distribution of our sources 
(AN/dz in Equation IB2jl . For consistency with Mye06 
(see their Figure 6), we estimate dN/dz from spectro- 
scopic matches to DR1QSO, matches to spectra taken 

8 Available at http://sdss.ncsa.uiuc.edu/qso/nbckde 
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Fig. 1.— The autocorrelation of all 299,276 (A g < 0.21) DR4 
KDE objects in our working area. Errors are jackknifed with a 
resolution of 10°. The short-dashed line is the best fitting bias 
model (Equation IB2D over the range 0.16' to 63' (~55 h^ 1 kpc 
to 22 h~ x Mpc at the DR4 KDE sample's mean redshift of z ~ 
1.4). The dotted line is the best fitting bias model (over the same 
range) wh en fi tting for stellar contamination in the KDE sample 
(Equation IC1I with loss = 0.18). The long-dashed line is the best 
fitting bias model with stellar contamination when the fitting range 
is extended out to 100' (~35 h~ 1 Mpc). The solid line shows the 
expected clustering for a linear bias model, which is easily rejected 
at extremely high significance. The lower panel shows the measured 
quasar bias relative to the linear bias model. The dot-dashed line 
is the best-fit power-law over 0.16' to 63'. We estimate dN/dz 
for the redshift distribution of the full DR4 KDE sample using a 
simple spline fit. 

from the SDSS second data release (|Abazaiian et al.l 
12004 henceforth DR2) or matches to the 2QZ. As our 
KDE technique is currently trained on DR1QSO, obtain- 
ing dN/dz from DR1QSO is arguably a fairer approach 
than using quasars from later data releases. Mye06 
demonstrated that including or excluding matches with 
the 2QZ or DR2 has little affect on the form of dN/dz, 
and further showed that the methodology of using spec- 
troscopic matches is broadly consiste nt with estimat- 
ing d N/dz from photometric redshifts (fWeinstci n et al.l 
l200l . 

3. QUASAR CLUSTERING RESULTS 
3.1. Mean Quasar Bias at z ~ 1 .4 

In Figure □ we show the (A g < 0.21) DR4 KDE auto- 
correlation. We fit bias models (see Equation IB2[) over 
scales of 0.16' to 63' (~55 h' 1 kpc to 22 h' 1 Mpc at the 
DR4 KDE sample's mean redshift of z = 1.4). The fit's 
upper scale limit is nominally set by stellar contamina- 
tion (see Appendix [Cj and the lower limit is set by the 
dark matter model we use for ujqq. Smi03 note that their 
models accurately reproduce A^ L to the limits of current 
simulations (~3%) at wavcnumbers of k < 10 h Mpc -1 
(> 1'); however, their models appear quite accurate even 
on scales several times smaller than this (see, e.g., Fig- 
ure 15 of Smi03) and, in any case, their models remain 
useful as a phenomenological description of dark mat- 
ter clustering on all our scales of interest. In particu- 
lar, any models that augment the approach of Smi03 at 
k < 10 h Mpc^ 1 should be easy to compare to Smi03, 



and thus to our results. 

Our best fit bias model to the DR4 KDE autocor- 
relation has b Q = 1A0 = 2.41 ± 0.08 (P <x 2 = 0.847), in 
good agreement with Mye06 (b Q =1A0 = 2.51 ± 0.46) 
but with considerably more precision. This is pre- 
ferred over the best-fit power-law model (u = 0.0493 ± 
0.00646»-°- 928±0 055 ; P <x 2 = 0.482). A linear bias model 
(see Figure [I]), is ruled out at an extremely high level of 
significance (P< x 2 ~ 10~ 23 ). Our measured 6q is in rea- 
sonable agree ment with the value of b Z Q 1A1 = 2. 42±q; 2 ,° 
obtained by iPorcianiTl Vlagiiocc hetti fc Norbergl (2004; 
henceforth PMN04) in a clustering analysis of 14,000 
M bj < -22.5 2QZ quasars, even though PMN04 use 
(78 = 0.8, which will inflate their result by ~ 13% as com- 
pared to our use of erg = 0.9. Our result is slightly at odds 
with the value of b Q = 135 = 2.02 ± 0.07 found by Cro05 
for the full 2QZ sample after correcting their result for 
rcdshift-space distortions. It is, however, well within the 
error bars of their empirical fit to the evolution of quasar 
clustering of b Q = (0.53 ± 0.19) + (0.289 ± 0.035)(1 + zf. 
The form of this empirical fit broadly implies that our re- 
sult (at z = 1.40) should be ~ 5% lower than an estimate 
at z = 1.47 and ~ 3% higher than at z = 1.35. 

To test stellar contamination effects, we also fit a model 
of the form a 2 ujqq + (1 — a) 2 uJss, where a is the KDE 
efficiency and ujss — 0.18 (as derived in Mye06) and find 
b Q = 2.48 ±0.15 and a = 0.956=bg;gf| (P <x 2 = 0.920). 
As we previously argued in Mye06, stellar contamination 
causes measurable deviation from the true ujqq only on 
scales of a degree or more. An appropriate approach 
would be to determine a at > 1° and input this value 
to fits on scales < 1°. If a > 0.9, this largely becomes 
unnecessary, as introducing an efficiency term does not 
significantly alter measured values of &q at < 1°. 

In Figure[T] we also fit a stellar contamination model to 
larger scales (where stellar contamination begins to dom- 
inate fits) as such a model should still be valid on these 
scales. When a stellar contamination model is fit out to 
100', we obtain b Q = 2.47 ± 0.15 and a = 0.970±g;gf? 
(P <x 2 = 0.908). This value of a reproduces the data 
very well out to at least 7° (see Figure [T}; however, al- 
though altering the scale of the fit affects estimates of 
a, 6q is essentially unchanged, as stellar contamination 
only dominates on large scales. We further note that al- 
tering ujss to 0.25, as is appropriate at ~30' (see Mye06) 
barely changes our result, giving 6q = 2.47 ± 0.14 and 
a = 0.974±^ 2 | (P <X 2 = 0.936); again, simply because 
stellar contamination is only influential on scales larger 
than those we typically fit. In general, throughout this 
paper, we will quote results for models that simultane- 
ously fit for a and 6q. However, in most cases fitting for 
a at < 1°, although illustrative, is overkill, and falsely re- 
duces the significance of our 6q measurements. We will 
therefore trust the significance estimates of those fits that 
ignore stellar contamination. 

3.2. Evolution in Quasar Bias 

The spectroscopic redshift distribution of quasars at a 
given photometric redshift can become increasingly com- 
plex as the photometric redshift bandwidth is reduced. 
Therefore, to model the evolution of quasar clustering in 
a number of photometric redshift bins, it is convenient 
to have a better mechanism for modeling dN/dz than 
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Fig. 2. — The spectroscopic redshift (z sp ectro) distribution of 
KDE quasars for some photometric redshift bins (z p hot) used in our 
analyses. The dotted lines are functional fits of the form described 
by Equation [TJ The plotted distributions have been normalized 
and the integral under the fitted functions is always within 0.1% 
of unity 

the simple spline fit used in Section 13.11 Typically, the 
photometric redshift distributions we study have a pri- 
mary peak where the photometric solution agrees with 
the spectroscopy and, in some cases, a minor, secondary 
peak where the photometric solution is inaccurate (due 
to so-called catastrophic failures). We therefore adopt 
the approach of summing a number of functions of the 
form 

dN = 7a exp ~ |z ~„ Zi| - dz (1) 
i 

where n (typically close to the Gaussian value of 2) , a, z 
and (3 are free parameters. These functions can excel- 
lently reproduce dN/dz (see Figure HJ. Our schema for 
binning in photometric redshift is chosen to maximize 
object numbers in each bin while limiting discrepancies 
between the photometric and spectroscopic redshift esti- 
mates (see Mye06). 

In Figure[3]we show the evolution of the angular quasar 
autocorrelation with photometric redshift. We derive es- 
timates of the quasar bias using Equation IB21 and also 
consider a two-parameter stellar-contamination model 
(Equation IC1[) . In Figure 0] we display our measured 
quasar bias evolution. Our data alone (i.e., without as- 
suming, 6q ~ 1 at z = 0), rules out constant &q at all 
redshifts at > 99.99%, dropping to > 97.9% if stellar con- 
tamination is allowed to freely vary. As discussed in Sec- 
tion ^. 1[ not fitting for stellar contamination likely better 
estimates significances. Using 2QZ data, PMN04 and 
Cro05 have independently determined that 6q evolves 
with redshift. We note that, after correcting for differing 
erg, our value of ^''°*- 1 - 87 disagrees with &g =189 from 
PMN04 but only at the 1.8<7 level, dropping to 0.8cr if 
stellar contamination is incorporated. The Mbj < —22.5 
restriction adopted by PMN04 is immaterial in this con- 
text, as every g < 21 quasar at z > 1.7 should be brighter 
than Mbj — —22.5. Bias determinations derived from 
cosmological models that are more like those used by 



Fig. 3. — Quasar clustering evolution as a function of photometric 
redshift. There are ~65,000 quasars in each bin except for the 2.1 < 
z phot < 2.8 bin, which contains ~28,000 quasars. The solid line is 
the expected clustering of dark matter derived from Smi03. The 
dotted line is our best fit model where only the quasar bias, 6q, is 
varied. The dashed line is a two-parameter model that incorporates 
stellar contamination as well as quasar bias (see Table fT] for model 
values). Errors in this plot are jackknifed and fits are made over 
scales of 0.16' to 63' using a full covariance matrix. A scale of 0.16' 
to 63' is ~55 h~ 1 kpc to 22 h~ 1 Mpc in all bins except the lowest 
redshift bin, where the scales are slightly reduced to ~50 h^ 1 kpc 
to 20 h' 1 Mpc. 
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Fig. 4. — Our estimates of the evolution of quasar bias compared 
to other authors, and to our estimates from Mye06. The solid 
line plots the semi-empirical relationship 6q(z) = 0.53 + 0.289(1 + 
z) 2 derived by Cro05, and the dotted lines track the la error. 
The open triangles show our best fit model where only the quasar 
bias, 6q , is varied. The solid triangles show the results when the 
stellar contaminatio n, is also allowed to va ry (see Table[T]for model 
values). PMN04 and lGrazian et al., (!2O0|j) use as = 0.8, so we also 
show the effect of projec ting their results to match our chosen value 
of as = 0.9. In Section 13. II we note that, due to the photometric 
nature of the redshifts we use, our points at the lowest and highest 
redshift are almost certainly at least 10% too high and 10% too 
low, respectively. Points at other redshifts are not at all biased by 
the use of photometric redshifts. 
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PMN04 and Cro05 are provided in Table [T] (see also the 
discussion in Section l4~Tj) . 

The photometric nature of our redshifts leads to prob- 
lems for our Zphot = 0.75 and z p hot = 2.28 bins (i.e., our 
lowest and highest redshift bins); in particular, where 
to plot these bins on the z p hot axis of Figure HI At 
1 < Zphot < 2, the mean redshift of our spectroscopic 
matches is close to Zphot) but beyond this range the 
true redshift is further from the photometric estimate, 
due to catastrophic failures in the photometric redshift 
estimation. Figure [2] demonstrates that at z p hot < 1 
{z p hot > 2) there is a secondary solution, amounting to 
12.6% (21.2%) of the area under dN/dz, at z phot > 2 
(zphot < 1). We can examine clustering at these sec- 
ondary Zphot solutions by adopting a similar approach to 
Equation [Cl] 



B 2 UJi = UJi + 2 - (1 - B) 2 U>2 



(2) 



Where, u)%, L02 and u>\+2 are, respectively, the true clus- 
tering at the primary and secondary z p hot solutions, and 
the clustering we measure as a combination of the two 
Zphot solutions. B is the relative contribution of the pri- 
mary and secondary z p hot solutions to dN/dz. We can 
then estimate the true value of the bias at the position 
of the primary solution in z p hot as 



&! +2 < +2 -(l-5) 2 6lo;f 



B 2 i 



,M 



(3) 



where the sup erscript M denotes the model values of u 
(Equation |B2] with 6q = 1), and the bi denote quasar 
bias. Although we don't know the true values of bi and 
62, they can be estimated from measurements of &q. 

If we follow this analysis, our value of &q at z p hot = 
0.75 (zphot = 2.28) should be reduced (increased) by 
at least 10%. The true values of &q must lie beyond 
even these 10% offsets, because we must use our mea- 
sured values of 6g at the secondary solution in z p hot, 
rather than the (unknown) true value, to estimate the 
true value in the primary z p hot bin. If we lower our 
estimates of frjj' 10 * -0 - 75 by 10% and increase our esti- 
mates of b s A hBt=2 ~ s 



by 10%, we find that our data rule 
out a constant 6q at all redshifts with a significance of 
> 99.9999%, dropping to > 99.6% if stellar contamina- 
tion is allowed to freely vary. We note that applying 
Equation [3] to our bins at z p hot = 1.20, 1.53, 1.87 has no 
affect on &q values, as any secondary solutions in z p hot 
have a very small weight (B < 0.05). 

3.3. The Luminosity Evolution of Quasar Clustering 

While the luminosity of UVX quasars depends some- 
what on the mass of the underlying black hole, the 
mechanisms that drive baryons onto the accretion 
disk feeding the black hole are also important. As 
such, models of quasar formatio n and evolution (e.g., 
iHopkins et alll2005al I2005U l2006h can be degenerate be- 
tween mass and luminosity. It is therefore useful to exam- 
ine constraints on quasar bias as a function of luminosity. 
As discussed in Mye06, ideal tests of quasar evolution 
would attempt to break luminosity-redshift degeneracy 
and examine multivariate quasar properties. We now 
repeat our clustering analysis as a bivariate function of 
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Fig. 



Quasar clustering evolution as a bivariate function 



of absolute magnitude, M g , and photometric redshift, 2 p h D t. The 
rows of the nine panels show three bins of roughly equal numbers 



Zphot • 



The columns divide each z. 



phot 



bin into three of equal 



numbers in Mg. There are ~30,000 quasars in each bin. Labeled 
in each panel are the mean g apparent and absolute magnitudes, 
the quasar bias, 6q, derived as in Figure [3] and the \ 2 probabil- 
ity of our best fit model where only 6q is fitted (the dotted line). 
The dashed line is a two-parameter model that incorporates stellar 
contamination as well as quasar bias. Table l2l displays model val- 
ues. Errors in this plot are jackknifed and fits are made over scales 
of 0.16' to 63' using a full covariance matrix. A scale of 0.16' to 
63' is ~55 h^ 1 kpc to 22 h~ 1 Mpc in all bins except the lowest 
redshift bin, where the scales are slightly reduced to ~50 h^ 1 kpc 
to 20 h' 1 Mpc. 



• 2 parameter 

1 parameter 

2 parameter 
> 1 parameter 
1 3 parameter 



85 
2=0.85 

1 44 
2=1.44 

1 92 
2=1.92 



Fig. 6. — Our quasar bias estimates as a bivariate function of 
<;-band absolute magnitude, M g , and photometric redshift, z p hot- 
The mean photometric redshift that corresponds to each shape is 
labeled in the plot. The open shapes show our best fit model when 
only the quasar bias, ?>q, is varied. The solid shapes show estimates 
when a second parameter, the stellar contamination, is also allowed 
to vary (see Table l2l for model values). 
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TABLE 1 

Estimates of the quasar bias, 6q and the quasar host halo mass Momh as a function of photometric redshift 



Zphot 




r = 0.21,0-8 


= 0.9 


T = 0.15,(78 = 0.8 


Zphot 


1 parameter model 


2 parameter model a 


1 parameter model 


range mean 


b Q Mdmh^^Mq) 


a b Q 





0.4,2.3 
0.4, 1.0 C 
1.0,1.4 
1.4,1.7 
1.7,2.1 
2.1,2.8 C 



1.40 
0.75 
1.20 
1.53 
1.87 
2.28 



2.4l±R; 
2.05±» : « 

o 9Q_|_0.I7 
Q.1B 
2.81±»;$ 



12.1 ±H xlO 12 
8.90 ±||i xlO 12 
12.5 ±|f xlO 12 

11.3 4'? x!0 12 
l5:o 

-4.6 



0.956±g 
0.903±2 
0.997±S 
0.884±g 
0.907±S 
0.840±g 



044 

m 
m 

n:!:> 



2A8± UI 

L95 4 

2.06±°| 
2.35±R : || 
3.05±0|| 

q on_|_0.76 



2.57±° 
2.06±; 

2.i6±; 

2.39±; 
3.00±J 

3.i3±: 



5.72 ±g 
4.08 ±i 
5.70 ±J 
4.30 ±| 
5.32 ±0 
5.95 ±S 



fg xlO 12 
P xlO 12 

H 19 

23 xl ° 

| xlO 12 
I xlO 12 



a With stellar contamination 1 — a. 

10 Due to catastrophic failures, values of bQ in this row need lowered at least 10%. The derived Mdmh use the reduced values. See 
Equation [3] and the associated discussion. 

c Due to catastrophic failures, values of 6q in this row need raised at least 10%. The derived M^mh use the increased values. See 
Equation [3] and the associated discussion. 



redshift and luminosity. We derive g-band absolute mag- 
nitudes (M g ) for KDE objects by assuming that each 
photoz is a reasonable ensemble est imate of r e dshift . We 
incorporate the K-correction from iWisotzki (|2000t see, 
e.g., Mye06). Consistent M g binning at every redshift 
is impractical for quasars (which span ^8 magnitudes in 
M g ), so, as in Mye06, we split the KDE sample into three 
photometric redshift bins, then subdivide these into three 
M g bins. We then measure the autocorrelation of each 
of these nine subsamples. 

In Figure [5] we plot the bivariate quasar autocorre- 
lation as a function of photometric redshift and abso- 
lute magnitude. As before, we derive the quasar bias 
over the range 0.16' to 63', plotting the results in Fig- 
ure [6] (see also Table [2|). Our results are at least twice 
as precise as the equivalent results from Mye06, but 
other than the general increase with redshift discussed 
in Section 13. 2\ there appears to be no discernible trend 
in quasar clustering with absolute magnitude. How- 
ever, a model with constant quasar bias as a function 
of absolute magnitude is only just accepted by our data 
(b Q = 2.50 ± 0.11, P <X 2 = 0.092). If we instead take 
6q values from models that allow a stellar contamina- 
tion component, a constant &q model is more acceptable 
(b Q = 2.55 ± 0.21, P< X 2 = 0.662). 

To test for luminosity-dependent bias we combine mea- 
surements for each set of two magnitudes within each 
redshift bin plotted in Figure [6] into a single inverse- 
variance-weighted estimate and determine whether this 
estimate would be rejected by the measurement in the 
third magnitude bin. Quoting the maximum rejection 
in each case, we find: (1) Incorporating stellar contami- 
nation, a maximum rejection of 0.2ct for z = 0.85, 0.2cr 
for z = 1.44 and l.Ocr for z — 1.92, and; (2) ignoring 
stellar contamination, a maximum rejection of O.lcr for 
z = 0.85, 0.5ct for z = 1.44 and 1.5ct for z = 1.92. 

4. DISCUSSION 

4.1. Quasar Host Masses 

Follow ing Cro05 we can use the ellipsoidal collapse 
model of lSheth. Mo fc Tormenl ((2001 to convert quasar 
biases into masses for the halos hosting UVX quasars. 
We weight this model across our (normalized) redshift 
distributions 



bQ(M DM H, z) = 1 + 



dz 



dN 



y/a(ai/ 2 ) + \/ab(av 2 ) 



2\l-c 



dz y/aS sc (z) 
{av 2 ) c 



(4) 



(cu/ 2 ) c + 6(1 -c)(l-c/2) 



where a = 0.707, b = 0.5, c = 0.6 and 5 sc (z), the crit- 
ical density contrast for spherical collapse, is given by 
iNavarro. Frenk fc White! (fl997n as 0.15(127r) 2 / 3 ^° 055 
(for flat cosmologies), where £l mz = 0, m (z) is given, e.g., 
in Mye06. 

Masses can be derived via v = 6 sc (z)/a r (M, z), where 
a r (M,z) =a r (M)D(z), and D(z) is the linear growth 
factor, which we appr o ximat e using the formula of 
ICarroll. Press fc Turned (|1992t see, e.g., Mye06). The 
mass variance for a halo, o 2 (Mumh) can be determined 
from the radius of a halo of mean mass Mdmh 



_ ( 3Mdmh 



1/3 



(5) 



where p Q = 2.78 x lO n f2 m /i 2 M Mpc" 3 is the present 
mean density of the universe. This mass scale implies a 
mass variance of 



k 3 p(k) 



3ji(kr) 



1 2 



kr 



T (6) 



where the term in square brackets represents a spherical 
top hat smoothing for the density field (Ji is the spherical 
Bessel function of first order). We set V so that as is tied 
to observations when r = 8 h^ 1 Mpc. 

We assume our bias values are valid in the linear 
regime (as our analysis suggests 6q is scale-independent 
over at least 0.055-22 hr 1 Mpc), and adopt a form for 
the (adiabatic, CDM) linear power spectrum of P(k) = 
T 2 (k)k n with n = 1 (the Harrison- Zcldovich-Peebles 
scale-invariant case). We adopt the t ransfer function, 
T(k) = T (q) given by Equation 29 of lETsenstein fc Hul 
(11998D. who, following | Bardeen et al.l ( 19861 : see also 
lEfstathiou. Bond fc Wh ite 1992), showed that the trans- 
fer function can be characterized by its shape (r) via 



Clustering of 300,000 Photo-Classified QSOs 



7 



q = 



hr 1 Mpc 



©l. 7 /r 



(7) 



for a CMB temperature parameterized as 2.782.7K. In 
pure CDM, Y = Q m h; however, baryon s affect the powe r 
spectrum shape, and T — > r o g (e.g., ISugivamal Il995[ ). 
Note that, as we analyze scales far smaller than the 
sound horizon, Y e g can be derived from the baryon frac- 
tion via r e g = a r £l m h with a r given by Equation 31 of 
lEisenstein feHul (fl998l ). 

Throughout this paper, we have used a concordance 
cosmological model with erg = 0.9 and T — 0.21, as 
quasar bias is not greatly affected by complementar- 
ily altering a% and T. However, the conversion from 
bq to Mdmh is somewhat dependent on erg and T. 
As such, we also now analyze our results in the con- 
text of a cosmological model with (ri m , ^a, ct§, T, 
h)= (0.28,0.72,0.8,0.15,0.7), motivated by recent su- 
perno yae, large-scale structure, and CMB measurements 
(e.g.. iRiess et al.1 l2004t ICole et all 120051: iSpereel et all 
12009) . Note that Equation 31 of lEisenstein & Hul (|1998l ) 
suggests that Y — a r Sl m h = 0.15 is close to adopting 
a (realistic) bar yon fraction of 0(,/fi m = 0.185 (e.g., 
ICole et alll2005h . 

In Table [1] we display our derived values for the mass 
of the dark matter halos hosting quasars. The values for 
Mdmh in our highest (lowest) redshift bins have been 
calculated based on raising (lowering) &q by 10% (see 
Equation [3]) . A Spearman rank test shows no signifi- 
cant correlation between redshift and Mdmh', however 
such a test is not compelling for only 5 redshift bins, 
particularly as we don't necessarily trust our 6q correc- 
tions at low redshift (as these corrections are dependent 
on the less precise values of 6q derived in our highest 
redshift bin). We will therefore assume, as detected in 
Cro05, that Mdmh is constant with redshift. Across 
all our individual redshift bins we obtain a weighted 
mean of M DM h = 4.8 ± 0.5 x 10 12 b. -1 M . If we in- 
stead only consider our "best" bins, in the range 1.0 < 
z < 2.1, we obtain M DMH = 5.2 ± 0.6 x 10 12 h _1 M . 
This value of Mdmh deviates ~1.3cr from the value of 
Mdmh = 3.0 ± 1.6 x 10 12 h -1 Af obtained by Cro05 
using a similar cosmology, and is slightly below the de- 
termination of M DM h ~ 10 13 M W from PMN04 (see also 
iPorciani fc Norberdl200a) . 

4.2. Luminosity-Dependent Quasar Bias 

Although, in section 13.31 we detected no significant 
luminosity dependence to quasar bias in any redshift 
bin, our detection of 1.5er in our highest redshift bin 
(z = 1.92) is close to being significant. We can esti- 
mate the factor by which our data sample would have to 
increase in size before luminosity-dependent bias could 
be detected using our methodology. Assuming that the 
noise reduction scales as the square root of the sample 
size, a 2a detection in our z — 1.92 bin would require a 
sample 3.8 times larger (including stellar contamination) 
or 1.8 times larger (ignoring stellar contamination). Sim- 
ilarly, a 3cr detection would require a sample 8.6 or 4.0 
times larger, respectively. 

A sample size twice as large as that used in this paper 
should be achievable in the near future. The necessary 
sample size to detect luminosity-dependent biasing will 



TABLE 2 

Bivariate Estimates of the QSO bias, 6q, as a function of 

PHOTOMETRIC REDSHIFT Z phot AND ABSOLUTE MAGNITUDE M g 





1 parameter model 


2 parameter model a 


Zphot 




a b Q 



0.4 < z phot < 1.2 

0.85 -23.99 
0.85 -22.82 
0.85 -21.69 



2.03±; 
2.00±; 
2.08±; 



8:3? 
n.37 



0.877± 
1.000± 
0.999± 



0.102 

m 

0-141 



1.89± 

2.oo±: 

2.08±: 



54 



1.2 < z phot < 1.65 

1.44 -24.84 
1.44 -23.81 
1.44 -23.25 



2.86±°|| 
2.62±F f 



2.68±! 



1 nnn _i_o.ooo 

l.UUU^„ 1 Q3 
0.920±S'™ 



0.878± 



2.86±2 
2.73±g 
2.87±S 



1.65 < z phot < 2.3 
1.92 -25.42 
1.92 -24.46 
1.92 -23.92 



2.77±°Jf 

o ok_i_0.45 

3.18±B;» 



0.865± 
0.947± 
0.869± 



0.109 

m 
m 

o.or,4 



2.87±; 
2.36± 
3.46±: 



a With stellar contamination 1 — a. 

be further reduced by improved photometric techniques. 
For example, although we attempt to restrict the range 
of photometric redshift over which we analyze bivariate 
quasar clustering to reduce the effect of catastrophic fail- 
ures on our bQ estimates, some quasars in our luminos- 
ity analysis will still be placed in entirely the wrong bin 
of M g , diluting the significance of any comparisons we 
make between M g bins. Finally, we note that it is highly 
unlikely that luminosity-dependent quasar bias can ever 
be detected, via our angular analysis, to magnitudes of 
g < 21 at redshifts z < 1.6. However, our let errors sug- 
gest that at z < 1.6 quasar bias changes with luminosity 
by less than ±0.6 (±0.3 if we ignore stellar contamina- 
tion) . 

5. SUMMARY AND FUTURE PROSPECTS 

In this paper, we used a sample of ^300,000 photomet- 
rically classified quasars drawn from SDSS DR4 to study 
the evolution of quasar clustering. Our main results are: 

(a) Over scales of 0.16' to 100' (-55 h- 1 kpc to 
35 h^ 1 Mpc at our sample's mean redshift of 
z — 1.4) quasar clustering is well-described by 
Smi03 fits to dark matter clustering in simula- 
tions, a ACDM cosmology and a single quasar bias 
parameter. Quasar biasing appears to be scale- 
independent over this range. 



(b) For er 8 = 
bias is b, 



Q 



0.9 and T 

:1A0 = 2.41 ± 0.09, rising to bf= 1A0 



0.21 the required quasar 



2.57 ±0.10 for cr 8 = 0.8 and Y = 0.15. 

(c) Our sample alone is sufficient to rule out a con- 
stant quasar bias over 0.75 < z < 2.28 (for scales 
of —55 hr 1 kpc to 22 hr 1 Mpc) at, conservatively, 
> 99.6%. This significance rises to > 99.9999% 
if stellar contamination is not explicitly fit, which 
is likely closer to the true significance of our de- 
tection (see section 13. ip . At z — 2.3 we find 
6q — 3. Considering complementary independent 
constraints on redshift evolution of 99.8% (Cro05) 
and 3.6a (PMN04) from the 2QZ, it is certain that 
quasar bias is therefore evolving with cosmic time. 



Myers et al. 



(d) Using our best photometric redshift ranges, 
<78=0.8, and T = 0.15, we find a mean mass for 
the dark matter halos hosting UVX quasars of 
M DM h = 5.2 ± 0.6 x 10 12 h _1 M Q , approximately 
halfway between the values of Mdmh determined 
from the 2QZ by PMN04 and Cro05. 

(e) We find no significant luminosity dependence to 
quasar clustering, but our analysis hints at a small 
dependence (1.5a) at high redshift (z = 1.92). This 
suggests that, with improved photometric classifi- 
cation efficiency, a sample size of as little as 1.8 
times larger (^550,000 objects) may be sufficient to 
detect luminosity dependence in quasar clustering 
at z ~ 2. This might distinguish "light bulb" ac- 
cretion (where quasars are either "off" or acc rete at 
one efficiency; see, e.g.. iValageas et aT]|2001|) . from 
mode ls that allow a r ange of accretion efficiencies 
fe.g.. lLidz et al.ll2006t) . 

(f) Our work ag r ees ex cellently with local results from 
iSerber et al . (2006), who studied the environments 
of quasars at z < 0.4 in DR3 (see their Fig- 
ure 2). We concur that there is little luminosity 
dependence to quasar clustering on proper scales 
of > 50 h^ 1 kpc (their 100 hjQ kpc comoving), 
and, also, that any weak luminosity trend is only 
expressed for brig hter quasars (M Q < —24; note, for 
comparison with lSerber et al.l l2006. that g-i ~ 0.3 
for UVX quasars). 

(g) It is highly unlikely that our technique will con- 
strain any luminosity dependence to quasar clus- 
tering to magnitudes of g < 21 at rcdshifts z < 
1.6. The errors on our measurements suggest that 
quasar bias bn is constant at z < 1.6 to <±0.6. 



Although our analyses in section 13731 uncovered no sig- 
nificant pattern, they were close to favoring particular 
trends. In the near future, the prospects for reassessing 
these results are excellent. Analysis of the angular clus- 
tering of photometrically classified quasars will improve 
not only as photometric surveys widen and deepen, in- 
creasing total numbers of objects, but also as classifica- 
tion efficiency improves, and is expanded to non-UVX 
Active Galactic Nuclei and to higher redshift. Thus, 
we expect the statistical power of clustering analyses 
of photometrically classified quasars to rapidly improve. 



Further, estimates of the evolution of quasar clustering 
will be enhanced as quasar photometric redshift esti- 
mates tighten a nd catastrophic estimates diminish (e.g., 
iBall et all I2007D . In combination we expect these fac- 
tors to eventually allow significant constraints on the lu- 
minosity evolution of quasar clustering, particularly at 
z > 1.6. 
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APPENDIX 

APPENDIX A. BACKGROUND METHODOLOGY 
CORRELATION FUNCTION AND MODEL FITTING 

We estimate the two-point angular correlation function (lo) via (|Landv fc Szalavj |T993f ) 

QQ(9) - 2QR(6) 

wW = me) + 



(Al) 



from counts of quasar-quasar (QQ), quasar-random (QR) and random-random (RR) pairs. We use a random catalog 
100 times larger than the data catalog. The random catalog is constructed using masks from the SDSS DR4 Catalog 
Archive Server and cutting both the KDE data and the random catalog to the SDSS DR4 theoretical footprint (which 
discards ~1.7% of the data). Our approach is detailed in Mye06, where we also discuss how points in our random 
catalog are assigned values of seeing and Galactic absorption. 

We estimate errors and covariance matr ices using inverse- variance- weighted jackknife resampling ([Scranton et al.l 
l2002t IZehavi et al.l 120(53 : iMvers et al.l l2005h . The jackknife method is to divide the data into N pixels, then create 
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TV subsamples by neglecting each pixel in turn. Note that if we considered the contribution of each pixel, rather 
than neglecting its contribution, this would be an inverse- variance- weighted pixel-to-pixel (also called field-to-field or 
subsampled) error estimate (e.g.. iMvers et al1l2003f ). We will generally refer to the chosen length for each side of the 
pixels as the jackknife resolution. If we denote subsamples by the subscript L and recalculate w^, in each jackknife 
realization via Equation I Al( then the inverse- variance- weighted covariance matrix (Cy) can be generated as 




O, = Otf,,tf, i = > ; J ^# [u^) - U (9i)] J^^y [u-LiHi)- ( A2) 



where, ui denotes the correlation function for all data and ojl denotes the correlation function for subsample L. 
Jackknife errors o~i are obtained from the diagonal elements (of = Co), and the normalized covariance matrix, also 
known as the regression matrix, is 

|C| = °*L (A3) 

OiOj 

The RRl I RR terms in Equation |A"2l (|Mvers et al.ll2005t ) weight by the different numbers of objects expected, due to 
holes, poor seeing, pixels that extend beyond the survey boundary, etc. We then estimate % 2 fits to model angular 
autocorrelation functions (u) m ) using the inverse of the covariance matrix, and determine errors on fits from A% 2 , 
where 

X 2 = J2 [<"(*«) - Cr/ [ U (0j) - u m (9j)} (A4) 

i,j 

The simplest models we fit are power-laws of the form ui m (8) = A9~ 5 , where the units of A (as we fit it) are arcmin* 5 . 
In general we fit the more physical models discussed in Appendix [B] 

MODELING PROJECTED QUASAR CLUSTERING 

Since the seminal scaling relations of lHamilton et al.l ([1991), many authors (e.g., iPeacock fc Doddsl Il994t 
IJain. Mo fc White! H995L IPeacockfcj5oddl 119961) have worked to obtain precise analytical descriptions of the clus- 
tering of dark m atter particles. iSmith et al.l (|2003l ; henceforth Smi03) married traditional approaches with a simple 
halo model (e.g., ICoorav fc Shethll2002f ) toobtain fitting formulae that better approximate the non-linear clustering 
behavior of simulated CDM. The Smi03 formulae reproduce clustering in dark matter simulations to better than 3% 
at (redshift) z < 3 for (wavenumber) k < 10 h Mpc -1 . 

The models of Smi03 directly predict the non-linear, dimensionless power spectrum of dark matter Aj5j L (£;, z) for a 
wide range of CDM cosmologies. The clustering of objects that formed in the rare peaks of a Gau ssian random field 
is expected to be biased relative to underlying dark matter (e.g., Kaiser 1984; Bardccn et al. 1986), in a manner that 
could depend on both scale and formation history. Thus the clustering of quasars relative to dark matter might be 
modeled as Aq(fc, z) = bg(k, z)A^ L (k, z), where 6q is the quasar bias. 

In this paper, we measure angular autocorrelations. For small angles (9 <C 1 radian), Limber's equation can be 
used to project the powe r spectrum into the angular autocorrelation (jLimberl 119531 : iPeeblesI 119801 : IPeacock! 119911 : 
iBaugh fc Efstathioul ll993) via 

where Jo is the zeroth-order Bessel function of the first kind, x is the radial comoving distance, dN/dz is the redshift 
selection function (normalized so that J °° [dJV/dzJdz = 1), and dz/dx = H z /c = H a ^Q m (l + z) 3 + CIa/c. Strictly, x 
should be the angular, or transverse, comoving distance; however, in a flat cosmology, radial and transverse comoving 
distances are equivalent, and the curvature term vanishes — F(x) — 1- We ultimately, model the angular quasar 
autocorrelation function as 

"QQ(g) = ^f f [ ^(M) AgjL f ' Z) Jo[k9 X (z)] (^) 2 VV m (l + zr + n A fdz (B2) 

In theory, with sufficient data, 6q(A:, z) may be directly constrained by loqq, although we will generally set &q to be a 
constant and constrain it by comparing the amplitudes of luqq and the projected matter power spectrum as a function 
of redshift and scale. 

In general, A^ L is not separable into individu al fu nctions of k and z. We therefore Monte Carlo integrate under the 
surface described by the integrand in Equation IB2I until the integration is evaluated to better than 1%. To optimize 
this process, two points are worth noting. First, the change of variables dk/k — ln(10)dlog(fc) allows more uniform 
sampling in fc-space. Second, although A^j L is not easily separated, we have determined that the "parameters of the 
spectrum" (k^ 1 , n e ff and C; see Appendix C of Smi03) can be approximated by splines to < 0.3% for at least z < 4 
in a ACDM cosmology. Figure [7] demonstrates a typical surface we might integrate under to obtain loqq- On scales 
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Fig. 7. — One example of a surface that we Monte Carlo integrate under to project the matter power spectrum into the angular 
autocorrelation function. The surface is plotted for 9 = 1', and <5 2 wjvi, = (^nl/^ 2 ) Jo[k8x(z)] (diV/dz) 2 dz/dx (see Equations IB1I and 
IB2I I. We obtain dN/dz from spectroscopic matches (with DR1QSO, DR2 or the 2QZ) to our photometrically classified sample. Two 
individual contributions to power are apparent at 1'; the non-linear matter spectrum at low k and a halo term at high k (see Smi03). 



< 30' such surfaces are fairly smooth and the Monte Carlo integrations rapidly converge. Our integrations remain 
tractable at the 1% level out to the largest scales we model (< 100'). 

APPENDIX B. POTENTIAL SYSTEM ATICS 
STELLAR CONTAMINATION 

We have addressed sources of systematic error in some depth in Mye06. In particular, we noted that the autocorrela- 
tion of KDE objects, to, combines clustering signals from a stellar component loss and the true quasar autocorrelation 
loqq. If a is the efficiency, the fraction of genuine quasars that are classified as such by the KDE technique, then 

w(0) - a 2 Lv QQ (8) + (1 - a) 2 LV S s(0) + e(0) (CI) 

where e is a tiny (theoretically zero) offset arising from QS, QR and SR cross-terms. 

If the efficiency of the KDE technique is high (i.e., a — > 1), stellar contamination is only important as ojqq — * 0. In 
Section 13.11 we fit the two-parameter model defined by Equations IB2I and IC1I to the DR4 KDE autocorrelation and 
estimate the stellar contamination (1 — a). In doing so, we derive a — 0.956±o'gfg, consistent with Mye06 (and with 
Richards ct al. 20Q4|), and find that our best analysis of the quasar bias is therefore at angles of 9 < 1°. 
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Fig. 8. — Error on the correlation function, a u at different jackknife resolutions. The error appears to be poorly defined, or flatten, 
around scales corresponding to the resolution. Generally, this is the same scale at which a pixel-to-pixel estimate of the error would break 
down. The data used to estimate was the DR4 KDE sample discussed in Section [2] with an initial dust cut A g < 0.24. 

M1SCLASSIFIED H II REGIONS 

In Paper2, we discuss non-stellar misclassified objects in the KDE catalog (generally H II regions in various galaxies). 
The fraction of such objects in the KDE sample is too small to affect clustering measurements, except on small scales 
where H II regions can mimic quasar pairs, and is thus generally negligible on our scales of interest. For instance, by 
visually inspecting pairs of KDE objects, we estimate that on scales of 12" (the smallest scale fit in this paper), the 
effect amounts to ~1.0<r (where a is the error on w), by scales of 30", the effect is < OAa, and on larger scales, where 
we exceed the observed angular size of most galaxies, the effect vanishes. We are engaged in determining the regions 
that need masked from KDE clustering analyses because of this small effect but (unlike in Paper2 where our focus is 
small scales) we make no attempt to correct for the effect in this paper. 

COVARIANCE AND THE JACKKNIFE RESOLUTION 

If the covariance between scales is perfectly accounted for, Equation IA4I should return identical estimates of \ 2 
irrespective of the jackknife resolution. Given that DR4 covers close to 7000 square degrees, we have adequate area 
with which to determine whether the jackknife resolution affects significance estimates. In Figures |8] and |9] we plot 
jackknife errors and covariance matrices for identical sets of data, obtained at different jackknife resolutions. 

Figure [5] clearly shows that error estimates arc influenced by the choice of jackknife resolution, and inflate on 
scales similar to the resolution. However, we might expect that error discrepancies will be offset by differences in 
the covariance matrices plotted in Figure [5J To test this we assume a model that, at every scale, is equal to w + a u 
(the tip of the la upper error bar) as obtained when jackknife resampling at 10°, and calculate x 2 f° r other jackknife 




6 (arcmin) 9 (arcmin) 

Fig. 9. — Normalized covariance matrix (see Eguation lA3|l at different jackknife resolutions. The covariance grows rapidly on scales larger 
than the jackknife resolution. On scales far smaller than the jackknife resolution, off-diagonal elements are well-mixed but still contribute 
to significance estimates. From top-left to bottom-right the panels represent jackknife resolutions of 0.3°, 1°,3° and 10°. The data used to 
estimate the covariance matrices was the DR4 KDE sample discussed in Section [2] with a nominal A g < 0.24 Galactic absorption cut. 



resolutions (over the 0.1' to 100' scales we study in this work). We find that jackknife resolutions of greater than a few 
degrees all return highly consistent x 2 estimates (within ~2%) but jackknife resolutions that lie increasingly within 
our scales of interest give increasingly inflated estimates of the significance relative to our 10° control. 

In this paper, we will use a jackknife resolution of 10°, for several reasons. The covariance matrix is well-mixed at 
this resolution, so that either neglecting or incorporating covariance gives similar y 2 estimates (to within 5%). Further, 
at a resolution of 10° pixel-to-pixel errors can be calculated (see Appendix [X| for our scales of interest, and we find 
these agree with the jackknife estimates. Note that generalizing the ideal jackknife resolution for a given analysis is not 
our focus in this work; we simply suggest that when jackknifing errors, different jackknife resolutions should be tested, 
particularly when the resolution is similar to the scales being probed. It is tempting to ask, however; if pixel-to-pixel 
errors are always well-defined for resolutions that consistently recover significance estimates, why use the jackknife at 
all? 

ADDITIONAL OBSERVATIONAL SYSTEM ATICS 

Mye06 discussed how observations that were made in poor seeing conditions or that trace absorption by dust in our 
Galaxy; could contaminate the true quasar clustering signal. Using a similar analysis to Mye06, with the narrower 
binning allowed by the larger DR4 data set, we find: (1) No clear pattern imposed by seeing variations; and (2) that 
an A g < 0.21 cut on our DR4 KDE sample (and random catalog) removes clustering imprints caused by Galactic 
dust. In agre e ment with Mye06, we find a clear clustering pattern in the KDE sample for A g > 0.22. We note that 
lYahata et alj (|2006[ ) found little change in the number density of KDE objects as a function of Galactic absorption, 
perhaps because their surface density analysis is insensitive to the relative numbers of stars and quasars assigned by 
the KDE technique (see expanded discussion in Mye06). We have checked that a range of cuts around A g < 0.21 
(in particular, our adopted cut of A g < 0.18 from Mye06) all yield statistically similar estimates of u>. For our main 
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analyses in this paper, we adopt no seeing cut and an A g < 0.21 cut, which discards ~12.5% of our sampled area. 
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